eBPF para profiling continuo: Parca y Beyla

eBPF-based continuous profiling transforms debugging desde “ad-hoc pprof cuando fails” a “profile sempre-on.” Stack moderno 2024: Parca, Grafana Beyla, Pyroscope (ahora Grafana). Este artículo cubre complementary roles + integration patterns.

El stack

Three tools complementary:

Parca: CPU profiling 24/7 via eBPF.
Beyla: auto-instrumentation HTTP + metrics + traces.
Pyroscope: language-specific deep profiling.

Combined: cluster-wide coverage + targeted depth.

Parca: what it covers

Installs DaemonSet, eBPF agent per node:

CPU flame graphs cluster-wide.
Compiled languages: Go, Rust, C++ (deep stack unwinding).
Interpreted: Python, Node (interpreter frames).
Prometheus-compatible: metrics + labels.
Minimal overhead: 0.5-1% CPU.

Beyla: auto-instrumentation

Covers:

HTTP/gRPC traces without SDK changes.
RED metrics (rate, errors, duration).
Service graphs automatic.
OTel output standard.

Not profiling per se — instrumentation sin code changes.

Pyroscope

Originally independent, now part Grafana:

Language agents: native libraries, deeper info.
Compiled + interpreted: richer data.
Storage/querying backend: can ingest de Parca.

Use case: deeper language-specific profiling específico apps critical.

Install

Parca

helm install parca parca/parca
helm install parca-agent parca/parca-agent --namespace monitoring

Agent per node, Parca server centralized. Auto-discovers processes.

Beyla

helm install beyla grafana/beyla \
  --set env.BEYLA_AUTO_INSTRUMENT_TARGET=my-app

Pyroscope

helm install pyroscope grafana/pyroscope

Grafana integration

Unified stack:

Grafana Tempo: traces.
Grafana Mimir: metrics.
Grafana Loki: logs.
Grafana Pyroscope: profiles (+ Parca ingestion).
Grafana: single dashboard.

Click trace → see profile de that period. Correlation fuerte.

Overhead real

Medido en production:

Parca: ~0.5-1% CPU per node.
Beyla: ~1-2% CPU, 50MB memory.
Pyroscope language agent: 1-5% depending.

Combined <5% — acceptable.

Use cases

Detect regressions

Deploy new version → Parca shows increased CPU in specific function → rollback before user impact.

Optimize hotspots

Weekly review flame graphs → identify top-10 CPU consumers → optimize.

Compare periods

“Why is latency up this week?” → compare profiles last week vs this week.

Root cause incidents

Post-mortem: profile during incident vs baseline highlights specific function.

Flame graph interpretation

Basics:

Width = CPU time.
Stack from bottom up.
Hot paths = wide boxes top-level.

Skill — learn con Brendan Gregg tutorials.

Language specifics

Go

Parca excellent: DWARF info, full stacks. Preferred.

Rust

Similar Go — compiled, rich stacks.

Python/Node

Interpreter frames visible. For Python-level detail, Pyroscope language agent mejor.

Java

JVM complex. Specialized Pyroscope Java agent.

C/C++

Native — Parca perfect.

Cost

Storage: profile samples compressed ~1GB/día para 50-node cluster.
S3 or similar para long-term.
Retention: 15-30 días typical.

Manageable.

Alternatives

Datadog Continuous Profiler: commercial, mature, multi-language.
New Relic: similar.
Elastic Profiling: newer.
Cloudflare AMP: specific context.

Open-source stack Parca/Beyla/Pyroscope comparable features + free.

Security

eBPF-based tools require:

Privileged container.
HostPID / HostNetwork.
Kernel sufficiently recent (5.10+).

Attack surface consideration. RBAC restrict.

For new adopters

Start path:

Install Parca en staging.
Review flame graphs semanas.
Skill build interpreting profiles.
Deploy production careful.
Correlate con traces/logs setup.

Skill + value accumulate.

When profiling not useful

No performance issues — waste effort.
Simple workloads — htop suffices.
No expertise interpreting — data sin action.

Most production systems benefit; some don’t.

Conclusión

Continuous profiling via eBPF es significant improvement sobre ad-hoc pprof. Parca + Beyla + Pyroscope stack covers spectrum sin big commercial cost. For performance-sensitive ops, invest. Skill in flame graph interpretation pays. Integration con Grafana stack seamless. Modern production observability increasingly includes profiling como fourth pillar (después metrics/logs/traces). Ahora is good time para adopt.

Síguenos en jacar.es para más sobre eBPF, profiling y observability stacks.

El stack

Parca: what it covers

Beyla: auto-instrumentation

Pyroscope

Install

Parca

Beyla

Pyroscope

Grafana integration

Overhead real

Use cases

Detect regressions

Optimize hotspots

Compare periods

Root cause incidents

Flame graph interpretation

Language specifics

Go

Rust

Python/Node

Java

C/C++

Cost

Alternatives

Security

For new adopters

When profiling not useful

Conclusión

Entradas relacionadas

WASI preview 3: hilos y async en WebAssembly

Llama 3.2 en el edge: Meta apuesta por lo pequeño

Cloudflare Workers en 2025: del edge a la empresa